How a Part of the Brain Might or Might Not Work: A New Hierarchical Model of Object Recognition

نویسندگان

  • Maximilian Riesenhuber
  • Tomaso Poggio
  • Earl K. Miller
چکیده

The classical model of visual processing in cortex is a hierarchy of increasingly sophisticated representations, extending in a natural way the model of simple to complex cells of Hubel and Wiesel. Somewhat surprisingly, little quantitative modeling has been done in the last 15 years to explore the biological feasibility of this class of models to explain higher level visual processing, such as object recognition in cluttered scenes. We describe a new hierarchical model, HMAX, that accounts well for this complex visual task, is consistent with several recent physiological experiments in inferotemporal cortex and makes testable predictions. Key to achieve invariance and robustness to clutter is a MAX-like response function of some model neurons which selects (an approximation to) the maximum activity over all the afferents, with interesting connections to “scanning” operations used in recent computer vision algorithms. We then turn to the question of object recognition in natural (“continuous”) object classes, such as faces, which recent physiological experiments have suggested are represented by a sparse distributed population code. We performed two psychophysical experiments in which subjects were trained to perform subordinate level discrimination in a continuous object class — images of computer-rendered cars — created using a 3D morphing system. By comparing the recognition performance of trained and untrained subjects we could estimate the effects of viewpoint-specific training and infer properties of the object class-specific representation learned as a result of training. We then compared the experimental findings to simulations in HMAX, to investigate the computational properties of a population-based object class representation. We find experimental evidence, supported by modeling results, that training builds a viewpointand class-specific representation that supplements a pre-existing representation with lower shape discriminability but greater viewpoint invariance. Finally, we show how HMAX can be extended in a straightforward fashion to perform object categorization and to support arbitrary class hierarchies. We demonstrate the capability of our scheme, called “Categorical Basis Functions” (CBF), with the example domain of cat/dog categorization, and apply it to study some recent findings in categorical perception. Thesis Supervisor: Tomaso Poggio Title: Uncas and Helen Whitaker Professor

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling storage and retrieval of memories in the brain

We have proposed a neural network model that stores the incoming information after orthogonalizing it in the same manner as vectors are orthogonalized. The scheme enables the brain to compare a new informational system with those in the memory and store its similarities and differences with the old memories in an economical manner. This allows the brain to have an enormous capacity and yet the ...

متن کامل

Mental Arithmetic Task Recognition Using Effective Connectivity and Hierarchical Feature Selection From EEG Signals

Introduction: Mental arithmetic analysis based on Electroencephalogram (EEG) signal for monitoring the state of the user’s brain functioning can be helpful for understanding some psychological disorders such as attention deficit hyperactivity disorder, autism spectrum disorder, or dyscalculia where the difficulty in learning or understanding the arithmetic exists. Most mental arithmetic recogni...

متن کامل

Modeling storage and retrieval of memories in the brain

We have proposed a neural network model that stores the incoming information after orthogonalizing it in the same manner as vectors are orthogonalized. The scheme enables the brain to compare a new informational system with those in the memory and store its similarities and differences with the old memories in an economical manner. This allows the brain to have an enormous capacity and yet the ...

متن کامل

P14: How to Find a Talent?

Talents may be artistic or technical, mental or physical, personal or social. You can be a talented introvert or a talented extrovert. Learning to look for your talents in the right places and building those talents into skills and abilities might take some work, but going about it creatively will let you explore your natural abilities and find your innate talents. You’re not going to fin...

متن کامل

Automated Tumor Segmentation Based on Hidden Markov Classifier using Singular Value Decomposition Feature Extraction in Brain MR images

ntroduction: Diagnosing brain tumor is not always easy for doctors, and existence of an assistant that                                                      facilitates the interpretation process is an asset in the clinic. Computer vision techniques are devised to aid the clinic in detecting tumors based on a database of tumor c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000